Compact High Speed Sense Amplifier for Non-Volatile Memory and Hybrid Lockout

ABSTRACT

A compact and versatile high speed sense amplifier suitable for use in non-volatile memory circuits is presented. The sense amp circuit uses one power supply level for the bit line driving path and a second supply level for a data latch of the sense amp. The latch&#39;s supply level is of a high level that used for driving the bit lines and can be provided by a charge pump. The sense amp need use only NMOS devices for its analog path. For balancing performance and current consumption, the sense amp also includes an additional latch to support a “hybrid lockout” sensing mode, where in a verify operation a read-lockout is used between different data states, but not between the low and high quick pass write (QPW) verifies.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a Continuation-in-Part of U.S. patent applicationSer. No. 13/605,424, filed on Sep. 6, 2012, which is in turn aContinuation-in-Part of U.S. patent application Ser. No. 13/536,758,filed on Jun. 28, 2012.

FIELD OF THE INVENTION

This invention relates generally to non-volatile semiconductor memorysuch as electrically erasable programmable read-only memory (EEPROM) andflash EEPROM, and specifically to sensing circuits for such memories.

BACKGROUND OF THE INVENTION

Solid-state memory capable of nonvolatile storage of charge,particularly in the form of EEPROM and flash EEPROM packaged as a smallform factor card, has become the storage of choice in a variety ofmobile and handheld devices, notably information appliances and consumerelectronics products. Unlike RAM (random access memory) that is alsosolid-state memory, flash memory is non-volatile and retains its storeddata even after power is turned off. In spite of the higher cost, flashmemory is increasingly being used in mass storage applications.Conventional mass storage, based on rotating magnetic medium such ashard drives and floppy disks, is unsuitable for the mobile and handheldenvironment. This is because disk drives tend to be bulky, are prone tomechanical failure and have high latency and high power requirements.These undesirable attributes make disk-based storage impractical in mostmobile and portable applications. On the other hand, flash memory, bothembedded and in the form of a removable card, are ideally suited in themobile and handheld environment because of its small size, low powerconsumption, high speed and high reliability features.

EEPROM and electrically programmable read-only memory (EPROM) arenon-volatile memory that can be erased and have new data written or“programmed” into their memory cells. Both utilize a floating(unconnected) conductive gate, in a field effect transistor structure,positioned over a channel region in a semiconductor substrate, betweensource and drain regions. A control gate is then provided over thefloating gate. The threshold voltage characteristic of the transistor iscontrolled by the amount of charge that is retained on the floatinggate. That is, for a given level of charge on the floating gate, thereis a corresponding voltage (threshold) that must be applied to thecontrol gate before the transistor is turned “on” to permit conductionbetween its source and drain regions.

The floating gate can hold a range of charges and therefore can beprogrammed to any threshold voltage level within a threshold voltagewindow (also referred to as a “conduction window”). The size of thethreshold voltage window is delimited by the minimum and maximumthreshold levels of the device, which in turn correspond to the range ofthe charges that can be programmed onto the floating gate. The thresholdwindow generally depends on the memory device's characteristics,operating conditions and history. Each distinct, resolvable thresholdvoltage level range within the window may, in principle, be used todesignate a definite memory state of the cell. When the thresholdvoltage is partitioned into two distinct regions, each memory cell willbe able to store one bit of data. Similarly, when the threshold voltagewindow is partitioned into more than two distinct regions, each memorycell will be able to store more than one bit of data.

In a two-state EEPROM cell, at least one current breakpoint level isestablished so as to partition the conduction window into two regions.When a cell is read by applying predetermined, fixed voltages, itssource/drain current is resolved into a memory state by comparing withthe breakpoint level (or reference current IREF). If the current read ishigher than that of the breakpoint level, the cell is determined to bein one logical state (e.g., a “zero” state). On the other hand, if thecurrent is less than that of the breakpoint level, the cell isdetermined to be in the other logical state (e.g., a “one” state). Thus,such a two-state cell stores one bit of digital information. A referencecurrent source, which may be externally programmable, is often providedas part of a memory system to generate the breakpoint level current.

In order to increase memory capacity, flash EEPROM devices are beingfabricated with higher and higher density as the state of thesemiconductor technology advances. Another method for increasing storagecapacity is to have each memory cell store more than two states.

For a multi-state or multi-level EEPROM memory cell, the conductionwindow is partitioned into more than two regions by more than onebreakpoint such that each cell is capable of storing more than one bitof data. The information that a given EEPROM array can store is thusincreased with the number of states that each cell can store. EEPROM orflash EEPROM with multi-state or multi-level memory cells have beendescribed in U.S. Pat. No, 5,172,338.

The transistor serving as a memory cell is typically programmed to a“programmed” state by one of two mechanisms. In “hot electroninjection,” a high voltage applied to the drain accelerates electronsacross the substrate channel region. At the same time a high voltageapplied to the control gate pulls the hot electrons through a thin gatedielectric onto the floating gate. In “tunneling injection,” a highvoltage is applied to the control gate relative to the substrate. Inthis way, electrons are pulled from the substrate to the interveningfloating gate.

The memory device may be erased by a number of mechanisms. For EPROM,the memory is bulk erasable by removing the charge from the floatinggate by ultraviolet radiation. For EEPROM, a memory cell is electricallyerasable, by applying a high voltage to the substrate relative to thecontrol gate so as to induce electrons in the floating gate to tunnelthrough a thin oxide to the substrate channel region (i.e.,Fowler-Nordheim tunneling.) Typically, the EEPROM is erasable byte bybyte. For flash EEPROM, the memory is electrically erasable either allat once or one or more blocks at a time, where a block may consist of512 bytes or more of memory.

The memory devices typically comprise one or more memory chips that maybe mounted on a card. Each memory chip comprises an array of memorycells supported by peripheral circuits such as decoders and erase, writeand read circuits. The more sophisticated memory devices operate with anexternal memory controller that performs intelligent and higher levelmemory operations and interfacing.

There are many commercially successful non-volatile solid-state memorydevices being used today. These memory devices may be flash EEPROM ormay employ other types of nonvolatile memory cells. Examples of flashmemory and systems and methods of manufacturing them are given in U.S.Pat. Nos. 5,070,032, 5,095,344, 5,315,541, 5,343,063, and 5,661,053,5,313,421 and 6,222,762. In particular, flash memory devices with NANDstring structures are described in U.S. Pat. Nos. 5,570,315, 5,903,495,6,046,935.

Nonvolatile memory devices are also manufactured from memory cells witha dielectric layer for storing charge. Instead of the conductivefloating gate elements described earlier, a dielectric layer is used.Such memory devices utilizing dielectric storage element have beendescribed by Eitan et al., “NROM: A Novel Localized Trapping, 2-BitNonvolatile Memory Cell,” IEEE Electron Device Letters, vol. 21, no. 11,November 2000, pp. 543-545. An ONO dielectric layer extends across thechannel between source and drain diffusions. The charge for one data bitis localized in the dielectric layer adjacent to the drain, and thecharge for the other data bit is localized in the dielectric layeradjacent to the source. For example, U.S. Pat. Nos. 5,768,192 and6,011,725 disclose a nonvolatile memory cell having a trappingdielectric sandwiched between two silicon dioxide layers. Multi-statedata storage is implemented by separately reading the binary states ofthe spatially separated charge storage regions within the dielectric.

Programming a page of memory cells typically involves a series ofalternating program/verify cycles. Each program cycle has the page ofmemory cells subject to one or more programming voltage pulses. Theprogram cycle is followed by a verify cycle in which each cell is readback to deteitiiine if it has been programmed correctly. Those cellsthat have been verified will be program-inhibited from subsequentprogramming pulses. The program/verify cycles continue with increasingprogramming voltage level until all cells in the page have beenprogram-verified.

Both reading and verifying operations are performed by executing one ormore sensing cycle in which the conduction current or threshold voltageof each memory cell of the page is determined relative to a demarcationvalue. In general, if the memory is partitioned into n states, therewill be at least n-1 sensing cycles to resolve all possible memorystates. In many implementations each sensing cycle may also involve twoor more passes. For example, when the memory cells are closely packed,interactions between neighboring charge storage elements becomesignificant and some sensing techniques involve sensing memory cells onneighboring word lines in order to compensate for errors caused by theseinteractions.

In order to improve read and program performance, multiple chargestorage elements or memory transistors in an array are read orprogrammed in parallel. Thus, a “page” of memory elements are read orprogrammed together. In existing memory architectures, a row typicallycontains several interleaved pages or it may constitute one page ofcontiguous memory cells. All memory elements of a page will be read orprogrammed together. In currently produced semiconducting integratedcircuit memory chips, a memory page may have as many as 64,000 memorycells or memory elements being read or sensed in parallel.

There is an ongoing need for increased performance. Additionally, themassively parallel memory page presents significant issues of noise andinterference among the closely packed memory cells and structures thatlimit sensing accuracy and ultimately performance and storage capacity.

Therefore there is a general need for high capacity and high performancenon-volatile memory. In particular, there is a need for sensing circuitsof increased speed and less noise.

SUMMARY OF INVENTION

In a first set of aspects, a sense amplifier for a memory circuitincludes first and second latch circuits, an intermediate circuit, andbit line selection circuitry. The intermediate circuit intermediatecircuit includes a first node selectively connectable to one or more bitlines. The bit line selection circuitry is connected to the first node,whereby the first node can selectively be connected to one or more bitlines. The first node can be connected to a first supply level forpre-charging of the first node for a sensing operation by a pre-chargeswitch. The first latch circuit is connectable to the intermediatecircuit to have a value latched in it set according the level on thefirst node. The second latch circuit is connectable to the intermediatecircuit to have a value latched in it set according the level on thefirst node. The values latched in the first and second latch circuitscan respectively be transferred to a data bus by first and secondswitches. In a sensing operation, after pre-charging the first node andprior to a subsequent pre-charging, the first and second data latchcircuits can sequentially be connected to have the value latched thereinset according to the level of the first node.

According to another set of aspects, a sense amplifier for a memorycircuit includes a latch circuit and an intermediate circuit. Theintermediate circuit includes a first node selectively connectable toone or more bit lines. The latch circuit is selectively connectable tothe first node. The first latch circuit can be connected to a data busby a first switch. The first node is selectively connectable to a firstvoltage supply level by a second switch. The first node is selectivelyconnectable to an external node by a third switch, where the pathbetween the external node and the first node is formed of only n-typedevices. When a value held in the first latch circuit is at a high valuethe first node is cut off from the first voltage supply level, and whenthe value held in the first latch circuit is at a low value the firstnode is cut off from the external node.

Various aspects, advantages, features and embodiments of the presentinvention are included in the following description of exemplaryexamples thereof, which description should be taken in conjunction withthe accompanying drawings. All patents, patent applications, articles,other publications, documents and things referenced herein are herebyincorporated herein by this reference in their entirety for allpurposes. To the extent of any inconsistency or conflict in thedefinition or use of terms between any of the incorporated publications,documents or things and the present application, those of the presentapplication shall prevail.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 illustrates schematically the functional blocks of a non-volatilememory chip in which the present invention may be implemented.

FIG. 2 illustrates schematically a non-volatile memory cell.

FIG. 3 illustrates the relation between the source-drain current I_(D)and the control gate voltage V_(CG) for four different charges Q1-Q4that the floating gate may be selectively storing at any one time.

FIG. 4 illustrates an example of an NOR array of memory cells.

FIG. SA illustrates schematically a string of memory cells organizedinto an NAND string.

FIG. 5B illustrates an example of an NAND array of memory cells,constituted from NAND strings such as that shown in FIG. 5A.

FIG. 6 illustrates a typical technique for programming a page of memorycells to a target memory state by a series of alternating program/verifycycles.

FIG. 7(1) illustrates the threshold voltage distributions of an example4-state memory array with an erased state as a ground state “Gr” andprogressively more programmed memory states “A”, “B” and “C”.

FIG. 7(2) illustrates a preferred, 2-bit LM coding to represent the fourpossible memory states shown in FIG. 7(1).

FIG. 8(1) illustrates the threshold voltage distributions of an example8-state memory array.

FIG. 8(2) illustrates a preferred, 3-bit LM coding to represent theeight possible memory states shown in FIG. 8(1).

FIG. 9 illustrates the Read/Write Circuits, shown in FIG. 1, containinga bank of sense modules across an array of memory cells.

FIG. 10 illustrates schematically a preferred organization of the sensemodules shown in FIG. 9.

FIG. 11 illustrates in more detail the read/write stacks shown in FIG.10.

FIG. 12 illustrates schematically an exemplary embodiment for senseamplifier circuit.

FIG. 13 illustrates an example of a sensing operation using the circuitof FIG. 12.

FIG. 14 illustrates an example of a lockout sensing operation using thecircuit of FIG. 12.

FIG. 15 illustrates an example of a quick pass write operation with twoforced values using the circuit of FIG. 12.

FIG. 16 illustrates an example of a quick pass write operation withthree forced values using the circuit of FIG. 12.

FIG. 17 illustrates an example of a floating quick pass write operationusing the circuit of FIG. 12.

FIG. 18 illustrates an example of measuring cell current using anexternal bias voltage using the circuit of FIG. 12.

FIG. 19 illustrates schematically a second exemplary embodiment forsense amplifier circuit.

FIGS. 20A and 20B are respective waveforms for a binary programoperation for the circuits of FIGS. 12 and 19.

FIGS. 21A and 21B are respective waveforms for a binary program verifyoperation for the circuits of FIGS. 12 and 19.

FIGS. 22A and 22B are respective waveforms for a multi-state programverify operation for the circuits of FIGS. 12 and 19.

FIG. 23 illustrates schematically a third exemplary embodiment for senseamplifier circuit.

FIG. 24 illustrates schematically a third exemplary embodiment for senseamplifier circuit.

FIGS. 25A and 25B illustrate some waveforms for different ways ofimplementing a verify operation in quick pass write.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Memory System

FIG. 1 to FIG. 11 illustrate example memory systems in which the variousaspects of the present invention may be implemented.

FIG. 1 illustrates schematically the functional blocks of a non-volatilememory chip in which the present invention may be implemented. Thememory chip 100 includes a two-dimensional array of memory cells 200,control circuitry 210, and peripheral circuits such as decoders,read/write circuits and multiplexers.

The memory array 200 is addressable by word lines via row decoders 230(split into 230A, 230B) and by bit lines via column decoders 260 (splitinto 260A, 260B) (see also FIGS. 4 and 5.) The read/write circuits 270(split into 270A, 270B) allow a page of memory cells to be read orprogrammed in parallel. A data I/O bus 231 is coupled to the read/writecircuits 270.

In a preferred embodiment, a page is constituted from a contiguous rowof memory cells sharing the same word line. In another embodiment, wherea row of memory cells are partitioned into multiple pages, blockmultiplexers 250 (split into 250A and 250B) are provided to multiplexthe read/write circuits 270 to the individual pages. For example, twopages, respectively formed by odd and even columns of memory cells aremultiplexed to the read/write circuits.

FIG. 1 illustrates a preferred arrangement in which access to the memoryarray 200 by the various peripheral circuits is implemented in asymmetric fashion, on opposite sides of the array so that the densitiesof access lines and circuitry on each side are reduced in half. Thus,the row decoder is split into row decoders 230A and 230B and the columndecoder into column decoders 260A and 260B. In the embodiment where arow of memory cells are partitioned into multiple pages, the pagemultiplexer 250 is split into page multiplexers 250A and 250B.Similarly, the read/write circuits 270 are split into read/writecircuits 270A connecting to bit lines from the bottom and read/writecircuits 270B connecting to bit lines from the top of the array 200. Inthis way, the density of the read/write modules, and therefore that ofthe sense modules 380, is essentially reduced by one half.

The control circuitry 110 is an on-chip controller that cooperates withthe read/write circuits 270 to perform memory operations on the memoryarray 200. The control circuitry 110 typically includes a state machine112 and other circuits such as an on-chip address decoder and a powercontrol module (not shown explicitly). The state machine 112 provideschip level control of memory operations. The control circuitry is incommunication with a host via an external memory controller.

The memory array 200 is typically organized as a two-dimensional arrayof memory cells arranged in rows and columns and addressable by wordlines and bit lines. The array can be formed according to an NOR type oran NAND type architecture.

FIG. 2 illustrates schematically a non-volatile memory cell. The memorycell 10 can be implemented by a field-effect transistor having a chargestorage unit 20, such as a floating gate or a dielectric layer. Thememory cell 10 also includes a source 14, a drain 16, and a control gate30.

There are many commercially successful non-volatile solid-state memorydevices being used today. These memory devices may employ differenttypes of memory cells, each type having one or more charge storageelement.

Typical non-volatile memory cells include EEPROM and flash EEPROM.Examples of EEPROM cells and methods of manufacturing them are given inU.S. Pat. No. 5,595,924. Examples of flash EEPROM cells, their uses inmemory systems and methods of manufacturing them are given in U.S. Pat.Nos. 5,070,032, 5,095,344, 5,315,541, 5,343,063, 5,661,053, 5,313,421and 6,222,762. In particular, examples of memory devices with NAND cellstructures are described in U.S. Pat. Nos. 5,570,315, 5,903,495,6,046,935. Also, examples of memory devices utilizing dielectric storageelement have been described by Eitan et al., “NROM: A Novel LocalizedTrapping, 2-Bit Nonvolatile Memory Cell,” IEEE Electron Device Letters,vol. 21, no. 11, November 2000, pp. 543-545, and in U.S. Pat. Nos.5,768,192 and 6,011,725.

In practice, the memory state of a cell is usually read by sensing theconduction current across the source and drain electrodes of the cellwhen a reference voltage is applied to the control gate. Thus, for eachgiven charge on the floating gate of a cell, a corresponding conductioncurrent with respect to a fixed reference control gate voltage may bedetected. Similarly, the range of charge programmable onto the floatinggate defines a corresponding threshold voltage window or a correspondingconduction current window.

Alternatively, instead of detecting the conduction current among apartitioned current window, it is possible to set the threshold voltagefor a given memory state under test at the control gate and detect ifthe conduction current is lower or higher than a threshold current. Inone implementation the detection of the conduction current relative to athreshold current is accomplished by examining the rate the conductioncurrent is discharging through the capacitance of the bit line.

FIG. 3 illustrates the relation between the source-drain current I_(D)and the control gate voltage V_(CG) for four different charges Q1-Q4that the floating gate may be selectively storing at any one time. Thefour solid I_(D) versus V_(CG) curves represent four possible chargelevels that can be programmed on a floating gate of a memory cell,respectively corresponding to four possible memory states. As anexample, the threshold voltage window of a population of cells may rangefrom 0.5V to 3.5V. Eight possible memory states “0”, “1”, “2”, “3”, “4”,“5”, “6” and “7” respectively representing one erased and sevenprogrammed states, may be demarcated by partitioning the thresholdwindow into eight regions in interval of about 0.4V each. For example,if a reference current, IREF of 0.05 uA is used as shown, then the cellprogrammed with Q1 may be considered to be in a memory state “1” sinceits curve intersects with I_(REF) in the region of the threshold windowdemarcated by VCG=0.43V and 0.88V. Similarly, Q4 is in a memory state“5”.

As can be seen from the description above, the more states a memory cellis made to store, the more finely divided is its threshold window. Forexample, a memory device may have memory cells having a threshold windowthat ranges from −1.5V to 5V. This provides a maximum width of 6.5V. Ifthe memory cell is to store 16 states, each state may occupy from 350 mVto 450 mV in the threshold window. This will require higher precision inprogramming and reading operations in order to be able to achieve therequired resolution.

FIG. 4 illustrates an example of an NOR array of memory cells. In thememory array 200, each row of memory cells are connected by theirsources 14 and drains 16 in a daisy-chain manner. This design issometimes referred to as a virtual ground design. The cells 10 in a rowhave their control gates 30 connected to a word line, such as word line42. The cells in a column have their sources and drains respectivelyconnected to selected bit lines, such as bit lines 34 and 36.

FIG. 5A illustrates schematically a string of memory cells organizedinto an NAND string. An NAND string 50 comprises of a series of memorytransistors M1, M2, . . . Mn (e.g., n=4, 8, 16 or higher) daisy-chainedby their sources and drains. A pair of select transistors S1, S2controls the memory transistors chain's connection to the external viathe NAND string's source terminal 54 and drain terminal 56 respectively.In a memory array, when the source select transistor S1 is turned on,the source terminal is coupled to a source line (see FIG. 5B).Similarly, when the drain select transistor S2 is turned on, the drainterminal of the NAND string is coupled to a bit line of the memoryarray. Each memory transistor 10 in the chain acts as a memory cell. Ithas a charge storage element 20 to store a given amount of charge so asto represent an intended memory state. A control gate 30 of each memorytransistor allows control over read and write operations. As will beseen in FIG. 5B, the control gates 30 of corresponding memorytransistors of a row of NAND string are all connected to the same wordline. Similarly, a control gate 32 of each of the select transistors S1,S2 provides control access to the NAND string via its source terminal 54and drain terminal 56 respectively. Likewise, the control gates 32 ofcorresponding select transistors of a row of NAND string are allconnected to the same select line.

When an addressed memory transistor 10 within an NAND string is read oris verified during programming, its control gate 30 is supplied with anappropriate voltage. At the same time, the rest of the non-addressedmemory transistors in the NAND string 50 are fully turned on byapplication of sufficient voltage on their control gates. In this way, aconductive path is effective created from the source of the individualmemory transistor to the source terminal 54 of the NAND string andlikewise for the drain of the individual memory transistor to the drainterminal 56 of the cell. Memory devices with such NAND string structuresare described in U.S. Pat. Nos. 5,570,315, 5,903,495, 6,046,935.

FIG. 5B illustrates an example of an NAND array 200 of memory cells,constituted from NAND strings 50 such as that shown in FIG. 5A. Alongeach column of NAND strings, a bit line such as bit line 36 is coupledto the drain terminal 56 of each NAND string. Along each bank of NANDstrings, a source line such as source line 34 is coupled to the sourceterminals 54 of each NAND string. Also the control gates along a row ofmemory cells in a bank of NAND strings are connected to a word line suchas word line 42. The control gates along a row of select transistors ina bank of NAND strings are connected to a select line such as selectline 44. An entire row of memory cells in a bank of NAND strings can beaddressed by appropriate voltages on the word lines and select lines ofthe bank of NAND strings. When a memory transistor within a NAND stringis being read, the remaining memory transistors in the string are turnedon hard via their associated word lines so that the current flowingthrough the string is essentially dependent upon the level of chargestored in the cell being read.

Program and Verify

FIG. 6 illustrates a typical technique for programming a page of memorycells to a target memory state by a series of alternating program/verifycycles. A programming voltage V_(PGM) is applied to the control gate ofthe memory cell via a coupled word line. The V_(PGM) is a series ofprogramming voltage pulses in the form of a staircase waveform startingfrom an initial voltage level, V_(PGM0). The cell under programming issubject to this series of programming voltage pulses, with an attempteach time to add incremental charges to the floating gate. In betweenprogramming pulses, the cell is read back or verified to determine itssource-drain current relative to a breakpoint level. The read backprocess may involve one or more sensing operation. Programming stops forthe cell when it has been verified to reach the target state. Theprogramming pulse train used may have increasing period or amplitude inorder to counteract the accumulating electrons programmed into thecharge storage unit of the memory cell. Programming circuits generallyapply a series of programming pulses to a selected word line. In thisway, a page of memory cells whose control gates are coupled to the wordline can be programmed together. Whenever a memory cell of the page hasbeen programmed to its target state, it is program-inhibited while theother cells continue to be subject to programming until all cells of thepage have been program-verified.

Examples of Memory State Partitioning

FIG. 7(1) illustrates the threshold voltage distributions of an example4-state memory array with an erased state as a ground state “Gr” andprogressively more programmed memory states “A”, “B” and “C”. Duringread, the four states are demarcated by three demarcation breakpoints,D_(A)-D_(C).

FIG. 7(2) illustrates a preferred, 2-bit LM coding to represent the fourpossible memory states shown in FIG. 7(1). Each of the memory states(viz., “Gr”, “A”, “B” and “C”) is represented by a pair of “upper,lower” code bits, namely “11”, “01”, “00” and “10” respectively. The“LM” code has been disclosed in U.S. Pat. No. 6,657,891 and isadvantageous in reducing the field-effect coupling between adjacentfloating gates by avoiding program operations that require a largechange in charges. The coding is designed such that the 2 code bits,“lower” and “upper” bits, may be programmed and read separately. Whenprogramming the lower bit, the threshold level of the cell eitherremains in the “erased” region or is moved to a “lower middle” region ofthe threshold window. When programming the upper bit, the thresholdlevel of a cell in either of these two regions is further advanced to aslightly higher level in a “lower intermediate” region of the thresholdwindow.

FIG. 8(1) illustrates the threshold voltage distributions of an example8-state memory array. The possible threshold voltages of each memorycell spans a threshold window which is partitioned into eight regions todemarcate eight possible memory states, “Gr”, “A”, “B”, “C”, “D”, “E”,“F” and “G”. “Gr” is a ground state, which is an erased state within atightened distribution and “A”-“G” are seven progressively programmedstates. During read, the eight states are demarcated by sevendemarcation breakpoints, D_(A)- D_(G).

FIG. 8(2) illustrates a preferred, 3-bit LM coding to represent theeight possible memory states shown in FIG. 8(1). Each of the eightmemory states is represented by a triplet of “upper, middle, lower”bits, namely “111”, “011”, “001”, “101”, “100”, “000”, “010” and “110”respectively. The coding is designed such that the 3 code bits, “lower”,“middle” and “upper” bits, may be programmed and read separately. Thus,the first round, lower page programming has a cell remain in the“erased” or “Gr” state if the lower bit is “1” or programmed to a “lowerintermediate” state if the lower bit is “0”. Basically, the “Gr” or“ground” state is the “erased” state with a tightened distribution byhaving the deeply erased states programmed to within a narrow range ofthreshold values. The “lower intermediate” states may have a broaddistribution of threshold voltages that straddling between memory states“B” and “D”. During programming, the “lower inteimediate” state can beverified relative to a coarse breakpoint threshold level such as D_(B).When programming the middle bit, the threshold level of a cell willstart from one of the two regions resulted from the lower pageprogramming and move to one of four possible regions. When programmingthe upper bit, the threshold level of a cell will start from one of thefour possible regions resulted from the middle page programming and moveto one of eight possible memory states.

Sensing Circuits and Techniques

FIG. 9 illustrates the Read/Write Circuits 270A and 270B, shown in FIG.1, containing a bank of p sense modules across an array of memory cells.The entire bank of p sense modules 480 operating in parallel allows ablock (or page) of p cells 10 along a row to be read or programmed inparallel. Essentially, sense module 1 will sense a current I₁ in cell 1,sense module 2 will sense a current I₂ in cell 2, . . . , sense module pwill sense a current I_(p) in cell p, etc. The total cell currenti_(TOT) for the page flowing out of the source line 34 into an aggregatenode CLSRC and from there to ground will be a summation of all thecurrents in the p cells. In conventional memory architecture, a row ofmemory cells with a common word line forms two or more pages, where thememory cells in a page are read and programmed in parallel. In the caseof a row with two pages, one page is accessed by even bit lines and theother page is accessed by odd bit lines. A page of sensing circuits iscoupled to either the even bit lines or to the odd bit lines at any onetime. In that case, page multiplexers 250A and 250B are provided tomultiplex the read/write circuits 270A and 270B respectively to theindividual pages.

In currently produced chips based on 56nm technology p>64000 and in the43 nm 32 Gbit×4 chip p>150000. In the preferred embodiment, the block isa run of the entire row of cells. This is the so-called “all bit-line”architecture in which the page is constituted from a row of contiguousmemory cells coupled respectively to contiguous bit lines. In anotherembodiment, the block is a subset of cells in the row. For example, thesubset of cells could be one half of the entire row or one quarter ofthe entire row. The subset of cells could be a run of contiguous cellsor one every other cell, or one every predeteunined number of cells.Each sense module is coupled to a memory cell via a bit line andincludes a sense amplifier for sensing the conduction current of amemory cell. In general, if the Read/Write Circuits are distributed onopposite sides of the memory array the bank of p sense modules will bedistributed between the two sets of Read/Write Circuits 270A and 270B.

FIG. 10 illustrates schematically a preferred organization of the sensemodules shown in FIG. 9. The read/write circuits 270A and 270Bcontaining p sense modules are grouped into a bank of read/write stacks400.

FIG. 11 illustrates in more detail the read/write stacks shown in FIG.10. Each read/write stack 400 operates on a group of k bit lines inparallel. If a page has p=r*k bit lines, there will be r read/writestacks, 400-1, . . . , 400-r. Essentially, the architecture is such thateach stack of k sense modules is serviced by a common processor 500 inorder to save space. The common processor 500 computes updated data tobe stored in the latches located at the sense modules 480 and at thedata latches 430 based on the current values in those latches and oncontrols from the state machine 112. Detailed description of the commonprocessor has been disclosed in U.S. Patent Application PublicationNumber: US-20060140007-A1 on Jun. 29, 2006, the entire disclosure ofwhich is incorporated herein by reference.

The entire bank of partitioned read/write stacks 400 operating inparallel allows a block (or page) of p cells along a row to be read orprogrammed in parallel. Thus, there will be p read/write modules for theentire row of cells. As each stack is serving k memory cells, the totalnumber of read/write stacks in the bank is therefore given by r=p/k. Forexample, if r is the number of stacks in the bank, then p=r*k. Oneexample memory array may have p=150000, k=8, and therefore r =18750.

Each read/write stack, such as 400-1, essentially contains a stack ofsense modules 480-1 to 480-k servicing a segment of k memory cells inparallel. The page controller 410 provides control and timing signals tothe read/write circuit 370 via lines 411. The page controller is itselfdependent on the memory controller 310 via lines 311. Communicationamong each read/write stack 400 is effected by an interconnecting stackbus 431 and controlled by the page controller 410. Control lines 411provide control and clock signals from the page controller 410 to thecomponents of the read/write stacks 400-1.

In the preferred arrangement, the stack bus is partitioned into a SABus422 for communication between the common processor 500 and the stack ofsense modules 480, and a DBus 423 for communication between theprocessor and the stack of data latches 430.

The stack of data latches 430 comprises of data latches 430-1 to 430-k,one for each memory cell associated with the stack The I/O module 440enables the data latches to exchange data with the external via an I/Obus 231.

The common processor also includes an output 507 for output of a statussignal indicating a status of the memory operation, such as an errorcondition. The status signal is used to drive the gate of ann-transistor 550 that is tied to a FLAG BUS 509 in a Wired-Orconfiguration. The FLAG BUS is preferably pre-charged by the controller310 and will be pulled down when a status signal is asserted by any ofthe read/write stacks.

With respect to the sense modules 480, a number of arrangements arepossible, with the next section presenting one particular set ofembodiments in detail. In addition, various embodiments for sensemodules that can be profitably incorporated into the arrangements givenabove are developed in U.S. Pat. Nos. 7,593,265 and 7,957,197. Referenceis also made to U.S. Pat. No. 7,046,568, which discloses a non-volatilememory device with low noise sensing circuits capable of operating at alow supply voltage; U.S. Pat. No. 7,173,854, which discloses a method ofreferencing the word line voltage close to the source of each memorycell in a page so as to alleviate the problem of source bias error dueto the ground loop; and U.S. Pat. No. 7,447,079, which discloses amemory device and method for regulating the source of each memory cellalong a page to a predetermined page source voltage.

Compact Sense Amplifiers

This section considers a particular arrangement for the sense modules480-i for use in the read/write circuitry presented in the precedingsections. FIG. 12 is a representation of such a compact and versatilesense amp, with various aspects of its operation being illustrated withrespect to FIGS. 13-18. As will be discussed, among its other featuresthis sense amp arrangement provides a way to pre-charge bit lines whiledoing data scanning. Another feature is that the sense amp circuit canprovide a way to set three different bit line levels used in the quickpass write (QPW) technique using dynamic latch, where quick pass writeis a technique where cells along a given word line selected forprogramming can be enabled, inhibited, or partially inhibited forprogramming. (For more discussion of the quick pass write concept, seeU.S. Pat. No. 7,345,928.) Also, it can provide a convenient way tomeasure the cell current.

Considering FIG. 12 in more detail, this shows the sense amp circuitwhich can be connected to a bit line at BL (lower left, below the bitline selection switch BLS 623) and to bus at SBUS (to the left of SEL609 below the element FLAG 601). The input signal CLK is received (lowermiddle) can be supplied at the lower plate of the capacitor CSA 631. Thesense amplifier is then also connected to a high voltage supply level(VDDSA) and ground.

A latch circuit FLAG 601 has a first leg with node FLG and a second legwith node INV, where these legs each have their node cross-coupled tothe gates of a pair of series connected transistors in the other leg.The first and second legs also each include a switch formed by a PMOSrespectively controlled by STF for 603 and FRB for 605, whereby each thelegs can by such off above the node. The level on the node FLG can thenbe connected to the bus at SBUS through the switch 609 with controlsignal SEL. The latch can be reset through the signal RST at 607, allowINV to be set to ground.

The bit line BL can be selectively connected to the node COM by use ofthe bit line selection switch BLS 623 and bit line clamp BLC 621. Thenode COM can also be directly connected to the high supply level by theswitch BLX 625. In between the bit line selection circuitry and thelatch FLAG 601 is the intermediate circuitry of the sense amp. Inaddition to the node COM is a node MUX that is connectable to the COMnode by use of the switch BLY 627. The node MUX can also be connected tothe high supply level by use of the PMOS 615 controller by FLA,dependent upon the level on FLG as this is connected to the gate of PMOS613 connected in series with FLA 615 between MUX and VDDSA.

The internal node SEN can be connected to, or isolated from, the MUXnode by the H00 device 639 and the COM node by the XX0 device 633. Thetop place of the capacitor CSA 631 is also connected to the internal SENnode of the sense amp. In addition to being connected to the bottomplate of CSA 631, the CLK signal is also connected to the MUX node byway of the transistor 635, whose gate is connected to the SEN node,connected in series with the independently controllable device STRO 637.The switch FCO 611 allows the node MUX to be connected to, or isolatedfrom, the level on the FLG node of the latch 601.

The arrangement of the elements in FIG. 12 has a number of usefulproperties, including the ability to pre-charge a bit line concurrentlywith a data transfer. During the bit line pre-charge through the bitline select switches, the MUX node needs to stay at the power supplyvoltage level. During data scanning (or called data transfer), the datainformation from the FLG node needs to be sent to the SBUS node.Consequently, the SBUS node toggles. Because the device FCO 611 canisolate the MUX node from the SBUS node, the MUX node will not bedisturbed during the data transfer. In this way, a bit line can bepre-charged at the same time as data transfer happens. Hence, the memorycircuit's performance can be improved by being able to do both of theseoperations at the same time.

Another useful property of the arrangement of FIG. 12 is that it allowsfor the three bit line values (allow, inhibit, partial inhibit) of thequick pass write (QPW) to be forced using a dynamic latch arrangement.During such a “3 BL QPW”, for the inhibited bit line case the MUX nodecan be firmly held at a VDDSA level during the scanning in operation.The arrangement of switches prevents the SEN node from leaking, so theSEN node's voltage can be maintained. The CLK node supplies voltage tothe inhibited bit lines. The SEN node of the inhibited bit linepre-charges to a high level, passing the CLK voltage level to theinhibited BL through the CLK-SEN-STRO-BLY-BLC-BLS path.

The arrangement of FIG. 12 also allows for an easy cell currentmeasurement. A selected bit line's voltage can be supplied from anexternal pad of the chip through theSBUS-SEL-FLG-FCO-MUX-BLY-COM-BLC-BLS path. This means that thecorresponding FLG node needs to be at an analog voltage level: besidesturning off the STF device, the INV node is pulled down to the ground bythe RST device. Biasing the RST node at a logic high level can pull andhold the INV node at ground.

Some of the different modes of operation for the sense amp of FIG. 12are discussed with reference to FIGS. 13-18.

No-Lockout Read/Program Verify Operation

As a first example, a no-lockout read or program verify mode ofoperation is illustrated with respect to FIG. 13. Initially, asillustrated by (1), the H00 639 and FCO 611 devices are turned on,discharging the SEN node through the FLG node as FLG is initially atground. The bit line then pre-charges through the BLS 623-BLC 621-BLX625 devices, as shown by (2). Next, as shown at (3), the SEN nodepre-charges to a certain voltage level through the H00 639 and FLA 615devices. The level at CLK then raises up and the XX0 633 device is thenturned on, (4). The SEN node will then develop: If the memory cell isconductive, the SEN node will be discharged; otherwise the SEN node willnot discharge much.

After the SEN node develops, the device pulls down the CLK node toground. Next, the FRB 605 device is turned off, and the RST 607 deviceis turned on (at (6)) to reset the FLG node to the high VDDSA voltagelevel. Then the FRB 605 device is turned on and the RST 607 device isturned off. Subsequently, the FCO 611 device is turned on to pre-chargethe MUX node from the FLG node. Next, the memory turns off the STF 603device, then turns on the STRO 637 device to develop the FLG node, asshown at (8). The level previously at SEN will determine whether device635 is on or not, in turn determining what level will develop on the FLGnode. Once the FLG node finishes developing, the STRO 637 device isturned off.

Thus, the state of the selected cell along the bit line BL is used toset the value on the node SEN, from where it is transferred to the FLGnode. At this point, the SEL 609 device can be turned on to transfer thevalue of FLG out to SBUS. Note also that once the result has beentransferred from the SEN node on to the FLG node, the device FCO 611 canbe used to isolate the rest of the sense amp circuitry from the latch601, while still allowing the value latched on the FLG node to betransferred out through SEL 609. Consequently, the data latched on theFLG node can be scanned at the same time that the sense amp moves on toa next process if this does require the latch FLAG 601.

Lockout Read/Program Verify Operation

A second mode of operation is a lockout read/program verify mode.Although a somewhat more involved process than the more commonno-lockout read, lockout read will draw less current as once a cellgenerates a positive read result (FLG high), it is removed from furthersensing. Note that this is a lockout from further sensing in a series ofsensing operations, as opposed to a programming lockout. For example, inmulti-state memory a sensing operation, whether for a data read,program-verify, or other reason, will often include a series of senseoperation. Putting this in the context of the exemplary embodiment, aseries of sense operations will include checking the memory cell's stateagainst a number of reference parameter by, in this example, pre-chargethe cell's bit line, applying a sensing voltage to the word line, andseeing if the bit line discharges through the cell. This is done for aseries of increasing sensing voltages corresponding to differing states.However, if a cell conducts enough to discharge the bit line at, say,the second sensing voltage, repeating the process again at a third,higher sensing voltage will supply no additional information, but onlyserve to waste the current used for it and any subsequent sensings;hence, the read lockout.

During a first read cycle, the operation for the lockout is similar tono-lockout operation just discussed with respect to FIG. 13, except thatwhere in FIG. 13 at (2) the bit line is pre-charged through theBLS-BLC-BLX path, now the BL pre-charges through the BLS-BLC-BLY-FLApath. Consequently, at the end of the first read pass, the level on theFLG node will either be VDDSA or ground, as described in the lastsection. The process for the second and subsequent read cycles isillustrated with respect to FIG. 14.The second pass (and any subsequentpasses) again begins with the H00 639 and FCO 611 devices turning on topass FLG's voltage (either VDDSA or ground) to the SEN node, as shown at(1).

The set of sub-processes are marked as (2), where, if negative sensingis being performed, CLK will pre-charge to a certain level (for example,this could be 0.6V to 1.7V in a practical implementation), while ifpositive sensing, CLK will stay at ground. At the same time, the BLY 627and STRO 637 devices turn on to pre-charge the bit line (BL). Scanningdata from FLG to external data latch (such as 430-i, FIG. 11) can happenat the same time (also shown as (2)). This is due to the switch FCO 611that can isolate the FLG node from the MUX node. Note that the level onthe node FLG controls the device 613 above FLA 615. If the original FLGdata is low, the bit line recovers through the BLS-BLC-BLY-FLA path as613 is on. Otherwise, the bit line is held at the CLK level through theSEN-STRO-BLY-BLC-BLS path.

After the BL recovers, the STRO 637 device is turned off, then CLK ispulled down to the ground. At (4), the SEN node is pre-charged throughthe H00 639-FLA 615 devices. The CLK level then raises up, after whichthe XX0 633 device is turned on. The SEN node will develop, as shown at(5). If the memory cell is conductive, the SEN node will be discharged;otherwise the SEN node will not discharge much. After the SEN nodedevelops, the memory turns off the XX0 633 device, then pulls down theCLK node to ground. The BLY 627 device is turned off. The FRB 605 deviceis then turned off and the RST 607 device turned on to reset the FLGnode to VDDSA voltage level, (8). The FRB 605 device can then be turnedon and the RST 607 device turned off. The MUX node is then pre-chargedfrom the FLG node by turning on the FCO 611 device. As shown at (10),the STF 603 device is turned off, then the STRO 637 device is turned onto develop the FLG node based on the value at SEN, which is connected tothe control gate of 635. After that, memory turns off the STRO 637device, and then turns on the STF 603 device. Once the FLG level isdeveloped, it can then be scanned out to SBUS through SEL 609.

Quick Pass Write, Two Forced Bit Line Values

During a program operation, for cells to be programmed the bit line isbiased to a low voltage (typically ground), while cells that are not tobe programmed or have verified and need to be locked out from furtherprogramming have their bit line biased high. In a quick pass write (QPW)arrangement, cells that are selected for programming that areapproaching their target level are partially inhibited to slow theprogramming process for better accuracy by raising their bit line levelsto an intermediate value. These bit lines values can be set in severalways. In this section the case where two of these bit line values areforced (or “2BL forcing”), both the program enable value (0V) and theQPW partial inhibit value (˜0.7V) are forced, while for the programinhibit case the bit line is left to float after being initially sethigh. An alternate arrangement where the high, program inhibit value isalso forced (or “3BL forcing”) is considered in the next section.

Considering the process as shown in FIG. 15, at (1) data is set on theFLG node from the SBUS by way of the SEL device 609. If the bit line isinhibited, the corresponding FLG=VDDSA is set, while otherwiseFLG=ground. The level on the FLG node then is used to set bit line valuein (2): the BLS 623, BLC 621, BLY 627 and the FCO 611 node are raised toa high voltage. Then the bit line will either pre-charge to the VDDSAlevel or stay at ground, depending on its FLG data. At (3), the BLC621/BLY 627/FCO 611 devices are turned off and the data is set again onthe FLG node. If the bit line is inhibited/QPW, the corresponding FLGvalue is VDDSA, otherwise FLG=ground.

At (4), the BLC 621/BLY 627 nodes are then raised again to a highvoltage. The memory will raise the FCO 611 device's gate node to avoltage level that will be used to control the QPW BL's voltage level,say ˜0.7V (for a VDDSA of ˜2.5V), for example, to set a level of ˜0.7Von BL. The inhibited bit line will float at a level high enough toinhibit the programming. The QPW BL is pre-charged through theFCO-BLY-BLC-BLS path, the programmed BL shares the same path but biasedat ground by its FLG node. Once the bit line becomes stable at theappropriate level, programming can be done.

Quick Pass Write Three Forced Bit Line Values

As just noted, for the “2BL-forcing” arrangement, the inhibited bit linewill float. This section considers a mode where the inhibit value isalso forced to the high supply level in a “3BL-forcing” arrangement,allowing all three values to be set by the single latch. The process, asillustrated with respect to FIG. 16, again begins with setting data onthe FLG node from the SBUS by way of SEL 609, as shown at (1). If thebit line is inhibited, the corresponding FLG=VDDSA, otherwiseFLG=ground. At (2), the H00 639 and FCO 611 gate nodes are raised to ahigh voltage, passing FLG's voltage level to the SEN node. The H00 639device is then turned off.

Next, as shown by the paths (3), the memory raises the BLS 623, BLC 621and BLY 627 gate nodes to a high voltage. The FCO 611 gate node is stillkept at a high level from the previous sub-operation, (2). Based onthese levels, the BL node will either pre-charge to the high VDDSA levelor stay at ground, depending on its FLG data. The H00 639 gate node isbiased at a threshold voltage that will keep the H00 639 device weaklyon for the 13L that is not to be inhibited; for an inhibited BL, the H00639 device is still off as the MUX node is at VDDSA level. At the sametime CLK node is charged to the VDDSA level. The inhibited BL is thenpre-charged through the FCO-BLY-BLC-BLS path. The other BLs also sharethis path, but held at ground by the FLG node.

After some time, the STRO 637 device is turned on. The SEN node willstill be a high level for the inhibited bit line, while it is at groundfor the other cases. Consequently, the device 635 will also be on forthe inhibited case. Consequently, as shown by the path (4), for aninhibited BL, its MUX node is hold firmly at VDDSA by the high CLKvalue. Consequently, the internal node SEN is again being used as aninternal dynamic latch where a voltage level can be parked.

The BLC 621/BLY 627/FCO 611 devices are then turned off and the memoryagain sets data on the FLG node, as shown at (5). If BL isinhibited/QPW, the corresponding FLG=VDDSA, otherwise FLG=ground. TheBLC 621/BLY 627 nodes are then raised to a high voltage again. For theFCO 611 device's node, this is raised to a voltage level that will beused to control the QPW BL's voltage level. The inhibited BL is held atVDDSA level through the CLK-SEN-STRO-BLY-BLC-BLS level. The QPW BL ispre-charged through the FCO-BLY-BLC-BLS path. Both paths are marked (6).The programmed BL shares the same path, but is biased at ground by itsFLG node. After the bit line is allowed stabilize, the correspondingselected word line can be programmed.

Floating Quick Pass Write

The mode discussed in this section is another variation on the quickpass write technique, a Floating Quick Pass Write (FQPW) or Nakamuraoperation. The ability to perform this operation for a given bit linewith only a single latch using the sense amp circuit of FIG. 12 will bediscussed with respect to FIG. 17. In this variation on quick pass writeimplementation, the bit lines are again of three groups: a first groupto be inhibited; a second group to be programmed; and the third group tobe slow programmed. In a first step, the first group is taken to a valueoffset some below the high level, VDDSA-ΔV, where the offset can be asettable parameter. For example, if VDDSA=˜2.5V and ΔV is ˜0.7V, thiswould be ˜1.8V. The second group pre-charged, then left to float at 0V.The third group is set to a low value, say ˜0.7V, then left to float. Atthe second step (see (6) below), the first group is taken to the highlevel, while the bit lines of groups 2 and 3 will get coupled up if thebit line is adjacent to a bit line of group 1.

Referring now to FIG. 17, as shown at (1), the memory sets data on theFLG node: If BL is inhibited, the corresponding FLG value is VDDSA,otherwise FLG=ground. Next, as shown at (2), the H00 639 and FCO 611nodes are raised to a high voltage to pass the FLG's voltage level tothe SEN node. The BLS 623 device could turn on at this time. At (3), theH00 639 node's voltage is lowered to make it barely above a thresholdvoltage to keep H00 639 NMOS weakly on for the BL that is not inhibited;for an inhibited BL, the H00 639 device is still off since the MUX nodeis at VDDSA level. The CLK node is raised to a level lower than VDDSA bya certain amount (VDDSA-DELTA), corresponding to the group 1, step 1described in the last paragraph. After some time, the memory turns offthe H00 639 and FCO 611 devices completely. Note that at this point thatthe SEN node for inhibited BL case will be at a very high level, whileit is at ground for other BL cases.

Once the BLC 621/BLY 627/FCO 611 devices are off, the memory again setsdata on the FLG node at (4). If the BL is inhibited/QPW, thecorresponding FLG level is VDDSA, otherwise FLG=ground. Once data isagain set, the BLC 621/BLY 627/STRO 637 nodes are raised to a highvoltage. The memory will raise the FCO 615 device's gate node to avoltage level that will be used to control the QPW BL's voltage level.The inhibited BL is charged through the CLK-SEN-STRO-BLY-BLC-BLS path,while, the QPW BL is pre-charged through the FCO-BLY-BLC-BLS path, andthe programmed BL shares the same path but biased at ground by its FLGnode. These paths are shown at (5). After some time, the memory turnsoff the FCO 611 device and raises CLK to the VDDSA level, as shown at(6). After the bit line stabilizes, the corresponding word line can beprogrammed.

Measurement of Cell Current using External Bias Voltage

A final example is a mode allowing a cell's current to be measured usingan external bias voltage. This is illustrated with respect to FIG. 18.Referring back to FIG. 12 first, the arrangement of the FLAG resetswitch RST 607 allows for the INV node to be held to ground. By placingan external voltage onto a pad of the memory chip that can be connectedto the SBUS node, this allows the amount of current drawn by the bitline to be measured. This can be used, for example, as part of a testmode to analyze device characteristics. When measuring the cell current,for example, half of the bit lines could be selected, the other halfunselected. (In the following, it is assumed that the unselected BLs arealso biased.)

For the selected BLs, the RST 607 device is always on to pull its INVnode to the ground as shown at (1), its STF 603/FLA 615 devices are off,and its FCO 611 device is on. For an unselected BL, its RST 607 deviceis off, while its STF 603/FLA 615 device is on, its FCO 611 device isoff. For the unselected BL, its FLG node is initialized to be at ground.Note that at this point that the FLG node of the selected BL is notcontrolled by the FLAG latch anymore, which is now floating at thispoint.

Next, the SEL/BLY 627/BLC 621/BLS 623 devices are turned on. BLY 627 andBLS 623 are at a high voltage. For a selected BL, its BLC 621 node is ata very high voltage to pass the bias voltage from the external pin tothe BL through the SBUS-SEL-FCO-BLY-BLC-BLS path. For the unselected BL,its BLC 621 node is biased at a level to control the BL's voltage, theunselected BL are pre-charged through the FLA-BLY-BLC-BLS path. Theseare both shown at (2). The amount of current being drawn can then bemeasured.

Second Embodiment

The preceding discussion, which are developed further in U.S. patentapplication Nos. 13/277,915 and 13/277,966, was based on the circuit ofFIG. 12. This section presents another embodiment, shown FIG. 19, whicha variation of the circuit of FIG. 12 and is numbered similarly (i.e.,701 of FIG. 19 corresponds to 601 of FIG. 12).

Relative to the circuit of FIG. 12, the embodiment of FIG. 19 nowincludes the transistor ICO 791 connected between the INV node of thelatch 701 and the MUX node. This can remove the need for a dynamic latchoperation. INV has the polarity of the program or verify operation and,as discussed below, during binary (SLC) programming, ICO 791 can passthe VDDSA level to the bit line BL for erase or verify pass of aselected cell along the bit line and pass VSS to BL for programming ofthe cell.

The circuit of FIG. 19 also differs from that of FIG. 12 in that thedrain side of H00 739 is now connected to high voltage level rather thanMUX, so that the SEN precharge is no longer related to FLG. As will bediscussed below, this allows the XX0 733 assist to be done at the sametime SEN is precharged. In addition, FLG can be reset to VDDSA inparallel with SEN precharging. With these modification, the programmingtime for both SLC and MLC operations can be improved by severel percent.

The operation of the circuit of FIG. 19 during a binary, or SLC, programoperation can be illustrated with respect to FIGS. 20A and 20B, whereFIG. 20A shows the operation for the circuit of FIG. 12 and FIG. 20Bshows the same case for FIG. 19. Considering the case of a “Fast SLC”program operation, the FLG value in 601 or 701 is not reset bytransferring data between it and the latches outside of the sense amp inorder to thereby speed up programming and the sense amp can keep theoriginal program data with adding additional latch. Before the firstprogram pulse, the memory transfers in the data into the sense amp. Asdiscussed in more detail above, for a program cell FLG=VDD and for theerase cell FLG=VDD. During verify, FLG is flip from VDD to VSS thoughthe STRO 637 and SEN 635 path (CLK=VSS during strobe) for the cell thatpasses verify.

For the circuit of FIG. 12, in order to inhibit the cell fromprogramming, the circuit biases the bit line BL to the VDD level forpreparing boosting at the later stages. Since FLG=VSS for the erase cellor verify pass cell, the FLA 615 path is used to pass VDDSA. When thememory is to program the cell, it will keep the BL to VSS level. Inorder to do so, the FLG level is first transferred to the SEN node(SEN=VDD for the program cell, and VSS for the erase or verify passcell), and then the STRO 637 path is used to discharge BL to VSS(CLK=VSS). FIG. 20A is the timing diagram of this operation: initallyH00 and FCO are taken high to pass the value to set the value on SEN,after which the STRO value goes high. (See discussion in preceedingsections for more detail.)

To speed up this operation, the embodiment of the circuit of FIG. 19adds the transistor ICO 791 to provide a path between the INV node ofthe latch circuit FLAG 701 and the MUX node. This allows the memory toskip the “dynamic latch” operation (that is, remove FLG to SEN transfer)of the circuit of FIG. 12 at the beginning of FIG. 20A, thereby speedingup the operation. The corresponding operation of FIG. 19 is illustratedby the waveforms of FIG. 20B. In contrast to FIG. 20A, in FIG. 20B boththe H00 and FCO levels stay low, thereby saving the time to raise thesevalues, perform the FLG to SEN trnasfer, and take these vlaues backdown. In FIG. 20B, the STRO value also can stay low as the SEN value nolonger needs to be sifted to the 13L, as this can be done by the ICOpath.

The design of FIG. 19 also allows for the verify operation to beimproved with respect to that of FIG. 12. FIGS. 21A and 21B respectivelyillustrate waveforms for an SLC program verify operation using thecircuits of FIGS. 12 and 19.

Considering FIG. 21A first, this shows the waveforms for the SLC programverify for the circuit FIG. 12, as is described further in thecorresponding section above. This process has the SEN node pre-chargedbased upon the FLG value, for which H00 639 and FCO 611 are high. DuringSEN precharge, the memory has to disable XX0 633 to avoid a direct pathcurrent from VDD (BLX 625)→COM (XX0 633)→SEN (H00 639)→MUX (FLG=VSS).Consequently, as shown in FIG. 21A, the XX0 vlaue needs to stay lowwhile H00 is high, without any overlap. (In FIGS. 21A and 21B, theintermediate value of XX0 before it goes fully high is an optionalvariation, as opposed to keeping it low until taken to its full value.)

In FIG. 19, as H00 739 is now connected between SEN and VDD, rather thanMUX, the SEN node can be directly connected to VDD as needed; and, asthis cuts of the path between MUX and COM by H00 739 and XX0 733, thehigh. H00 and XX0 values can overlap. This is shown in FIG. 21B, wherethe hump in the H00 no longer needs to be back to the low voltage levelbefore XX0 can be raised, thereby speeding up operation. The combinedchanges of FIGS. 20B and 21B can allow an increase of around 5-10percent in SLC programming performance.

The change in the connection of H00 739 in FIG. 19 relative to H00 639in FIG. 12 can be used to improve multi-state programing performance.FIGS. 22A and 22B respectively show a MLC program verify operation forthe circuits of FIGS. 12 and 19. In the MLC program verify operationusing FIG. 12, as illustrated in FIG. 22A (and as described in moredetail above), the FLA 615 path is used to pre-charge SEN node. Thememory also will need to reset FLG from VSS to VDD before strobing,which cannot be done in FIG. 12 until the pre-charge of the SEN is done.(In FIGS. 22A and 228 the four bumps of the RST signal correspond to thefour states in a 2-bit per cell embodiment.)

In the embodiment of FIG. 19, H0O 739 can be used to connect the SENnode to VDDSA. This allows the SEN node to be pre-charged without usingthe FLA 715 path, which in turn allows the SEN pre-charge to be doneconcurrently with resetting the FLG level. In FIG. 22B, the XX0, H00,and CLK lines are much as before to pre-charge SEN. However, now the RSTwaveform to reset FLG can now be moved forward to overlap withpre-charge of SEN, along with FRB, the rising of FLA, and the rising FCOall being shifted earlier. This allows the MLC verify to be sped up,improving MLC programming performance by several percent.

Third Embodiment

FIG. 23 is a third embodiment of a sense amplified circuit that canachieve faster access time with smaller layout area and less powerconsumption. In FIG. 23, corresponding elements are numbered similarlyto the embodiments of FIGS. 12 and 19, (i.e., FCO is 611, 711 and 811 inthe corresponding figures). Relative to the embodiment of FIG. 19, theelements of FLAG 801, ICO 891, and FCO 811 are reversed.

A first difference relates to the power supply connections of the senseamp. To reduce power consumption, the embodiment of this sectionseparates the power supply level of the latch 801 and the power supplylevel used to drive the bit lines. When in read and verify mode, thesense uses one supply level, such as of 2.5V or more, to drive a bitline. However, there is usually no reason to use such high voltage fordata storage and program inhibit operations. For example, a typicalprogram inhibit level requirement for a bit line BL is lower, around1.8V to 2.0V. In this embodiment, another separate power supply is usedfor this function, resulting in lower charge consumption during program.Also, because the latch supply level is being reduced, the averagecurrent drawn for scan operation and strobe operation is reduced.

This is shown in FIG. 23, where these function are split by having thesense amp connected to the two different supply levels. The level VDD,which is used for the program inhibit, is used at the supply level atthe latch FLAG 801, rather than the VDDSA level as in FIG. 19. Outsideto the latch, the higher VDDSA level is still connected through BLX 825and HLL 839 for use in pre-charging bit lines for sense operations. Inthe exemplary embodiment, VDD is available on the chip as it is used byvarious peripheral circuit elements and the VDDSA level can also beprovided as a regulated level with use of a charge pump.

Another difference from the embodiment of FIG. 19 is that FIG. 23 lacksthe elements corresponding to the PMOS transistors 713 and FLA 715, sothat the circuit requires less area. The program inhibit level bit linelevel can be pre charged directly from the INV latch level. As shown inFIG. 23, the only PMOS transistors are RSTF803 and RSTI 805 of the latchcircuit, which are connected only to VDD. As VDD is the common supplylevel used in the other peripheral logic circuitry on the device, thereis no longer a need for two separate NWELLs, one for VDD and one forVDDSA. This can help to minimize the layout area in peripheral circuitryand local buffer area. Use of the lower VDD level in the latch can alsohelp reduce the amount current, and therefore power consumed, in theoperation of the FLAG latch 801.

Another difference between FIGS. 19 and 23 is that the reset transistorRST 707 is missing in FIG. 23, which has instead added the switch MTG893. MTG 893 can discharge the latch node INV (through ICO 891) and alsodischarge the latch node FLG (though FCO 811). The MTG transistor 893can also discharge bit line without using latch information. Thisarrangement allows the circuit to perform operations in parallel,discharging the bit line at the same time as transfers the sensingresult out to other latches.

The embodiment of FIG. 23 further differs from the preceding embodimentin the connected in its “strobing” path, the path through STRO 837 fromthe CLK node to the FLG node of the latch 801. In FIG. 23 this path isconnected to the latch node FLG directly at the node marked X above FCO811, instead of being connected at the MUX node as in FIG. 19. Thisdirect connection removes the need for a pre-charge sequence of the MUXnode in order to avoid charge sharing during strobe, when the analogvoltage on the SEN node is converted to a digital value (VDDSA orground/VSS) at FLG.

Fourth Embodiment: Hybrid Lockout

This section presents a fourth set of sense amp embodiments that addseveral features to the embodiments of the preceding section. Inparticular, it includes some modifications adapted to the implementationof a quick pass write (QPW) functionality and a “hybrid” lockout sensingoperation, which is described below. FIG. 24 shows an exemplary circuitfor the fourth embodiment. In FIG. 24, corresponding elements are againnumbered similarly to the embodiments of FIGS. 12, 19, and 23, (i.e.,FCO is 611, 711, 811, and 911 in the corresponding figures).

Much of FIG. 24 is much as shown in FIG. 23 and will function asdescribed in the preceding section. A first difference of FIG. 24relative to FIG. 23 that although the nodes COM and SEN can both beconnected to VDDSA through respective switches BLX 925 and HLL 939, thisis now done by way of switch 941 whose gate is connected to FLG. Aswitch 943, also controller by FLG, is also now connected between COMand SEN in series with XX0 933. When FLG is high, SEN and COM can bothbe connected to VDDSA and to each other, but when FLG is low, thesepaths are now cut off. When FLG is low, INV is high and a newly addedpath to COM is available: COM is now connected to a node to receive thelevel SRCGND by way switch GRS 947 in parallel with switch 945, whosegate is connected to INV.

FIG. 24 now also includes a second latch circuit LATCH 951 connected tothe SEN node. LATCH is also connected to the bus SBUS through switch 961controlled by SEL. To be able to selectively connect either FLAG 901 orLATCH 951 to SBUS, a switch SCNL 963 is connected in series with 961 anda switch SCNF 965 is added in parallel with 909. LATCH 951 can either beimplemented as a static or dynamic latch, where a dynamic latch is shownin this example. The node LAT is connected to ground by way of a switchSTRL 955 in series with switch 953, whose gate is connected to the nodeSEN. LAT can also be connected to VDDSA by switch PCHL 957 and acapacitor CLAT 952 is connected between LAT and ground to hold chargefor the node LAT. The level on LAT is then connected to the control gatefor the switch 959.

As with the embodiment of FIG. 23, two different supply levels are againused in the sense amp circuit of FIG. 24, where most of the circuitagain using VDDSA, while the latch FLG 901 uses a different level. Inthe embodiment of this section, as shown in FIG. 24, though, the latchFLG uses a higher voltage level that can be supplied from a charge pump.In FIG. 24, this voltage level in labeled as VFOUR as this example usesa value of 4V, although other values can be used. In this example, theVDDSA level can be taken as −2.5V and the level at the SRCGND node canrange up 1V-1.7V or even 2V. For these values, the use of a higher (here4V) power supply for FLAG 801 also for the circuit to easily to pass the(e.g. 2V) SRCGND or the (e.g. 2.5V) VDDSA to the BL node while onlyusing NMOS devices. Here, when FLG is high (and INV low), the switches841 and 843 allow the VDDSA level to pass due to the higher VFOUR valueat their gate, while when INV is high (and FLG low) the switch allows awide range of SRCGND values to pass. In this arrangement, VDDSA is thelevel used for both to pre-charge for sensing and for program inhibit.For quick pass write (QPW), the supply is also from VDDSA, but the levelis clamped by the BLC 921 gate voltage. SRCGND is the path for verifylockout or to program BL (VSS). Also, by restricting the higher VFOURvalue to just FLAG 901, it keeps the sense amp from drawing too muchcurrent as would result from also setting VDDSA at this higher value.

The use of this higher voltage (here 4V) for the FLG and INV values ofthe latch FLAG latch 901 allows the switches 841, 843 and 845 to passthe full range of VDDSA and SRCGND values used in the sense amp to bepassed on to the bit line at BL while using only NMOS devices in thepaths. This allows for the sense amp of FIG. 24 only use N-type devices,except for the latch FLAG 901, and still set all of the desired valuesfor hybrid lockout sensing operations. Without this higher VFOUR value,in order to pass a higher voltage than something like 1.2V from SRCGNDto lockout the bit line at BL, previous desings would use a CMOS passgate to pass this level to BL. And also in order to resolving theclocking noise issue, a CMOS pass gate may also be added in between SENand BL. However, the use of CMOS pass gates, where an NMOS and a PMOSare connected in parallel, has some disadvatages. One of these is thatthere can be a forbidden region of a range between where the NMOS andPMOS conduct, so that some levels will not be passed. Another is theincreased area and metal routing that PMOS devices need, particularly ifthey require a separate well level from the PMOS devices in FLAG. Byusing only NMOS devices for the analog path, the embodiment of FIG. 24can resolve the limited CMOS operation zone issue also achieve a compactlayout area.

As noted above, the addition of the second latch LATCH 951 allows thesense amp of FIG. 24 to support a “hybrid lockout” mode for sensing.Both Quick Pass Write (QPW) and lock-out sensing have been discussedabove with respect to the first set of embodiments. Hybrid lockoutsensing is a mixture of these ideas, in that the bit line is locked outafter verify at the high verify level of a state, but not locked outbetween verifying at the low and high verify levels of a state; that is,bit lines are locked out between different states, but not between thelow and high verify levels of each state. This hybrid mode can provideno-lockout verify programming performance with lockout program currentlevel.

FIG. 25A illustrates a typical set of word line waveforms for a commentimplementation of quick pass write (QPW) and FIG. 25B illustrates analternate approach that is preferable used with the sense amparrangement of this section. FIG. 25A is a schematic illustration of aQPW verify control gate waveform. Here, the memory cells can be programfrom a ground or erased state into states A, B, C, and so on. After aprogramming pulse, each cell is verified for the different levels and,if the cell verifies at its target level, it is inhibited from furtherprogramming. In a QPW program, each verify level is split in two: a lowverify and a high verify. For example, the A state is now verified forthe low level VAL and high level VAH, and similarly for the otherstates. When a cell verifies at the high level for its target state, itis fully inhibited, while if it verifies at the low level of its targetstate it is partially inhibited to slow, but not stop programming. (Formore discussion of the quick pass write concept, see U.S. Pat. No.7,345,928.)

The approach of FIG. 25B shows a slightly different approach, (In FIGS.25A and 25B the time scales are not meant to correspond, being schematicin both cases.) In a sense operation with the sense amp of FIG. 24, abit line is pre-charged, a sensing voltage placed on its control gateand then after a period of time, the amount of discharged is checked. Inthe arrangement of FIG. 25B, only one control gate voltage (VCG) is usedfor each state, so that, for example, VA in FIG. 25B would correspond tothe high level VAH of FIG. 25A and similarly for other states; but now,for each level, the discharge level is checked twice in relatively quicksuccession. These two sense operations then correspond to the low andhigh levels, hence the notation ˜VAL, ˜VAH, and so on. The bottom lineshows the strobe waveform as applied to STRO 937. This first time STROgoes high is to set the QPW latch LATCH 951 and the second time to setFLAG 901, both being done from the same discharge: When STRO is high,the circuit latches in the sensing result into the FLG node; STRL alsohas the same polarity as STRO and when it is high, the sensing resultwould be latched into node LAT. (More detail on implementing a quickpass write operation using this sort of sensing technique is given inU.S. Pat. No. 8,233,324.) Lockout sensing is only implemented by theFLG/INV value, while LAT, is to set partial programming inhibit.

Under this arrangement, when the sense amp has FLG is high (INV low) forsensing. When FLG is low (INV high), sensing is locked out and theSRCGND level can be connected to the BL node for verify lock out. Inthis way, the FLG value drives bit line and only flips when go A→B, B→C,etc., while LAT is independent of 13L driving current and can be usedfor the QPW value.

Consequently, the fourth set of embodiments described in this sectionhas a number of features that be advantageously applied in the senseamp's operation. A first of these is the use of separate power supplylevels for the primary data latch and the bit line driving path. Inparticular, the primary latch uses a higher power supply level, such ascan be provided by a charge pump. Another feature is the use of onlyNMOS devices only for the analog path, which can not only resolve thelimited operation zone issue that can arise from CMOS pass gates andalso achieve a compact layout area. Further, the addition of a latchallows the sense amp to support a “hybrid lockout” mode for balancingperformance and current consumption.

CONCLUSION

Although the various aspects of the present invention have beendescribed with respect to certain embodiments, it is understood that theinvention is entitled to protection within the full scope of theappended claims.

It is claimed:
 1. A sense amplifier for a memory circuit, comprising: anintermediate circuit, including a first node selectively connectable toone or more bit lines; bit line selection circuitry connected to thefirst node, whereby the first node can selectively be connected to oneor more bit lines; a pre-charge switch by which the first node can beconnected to a first supply level for pre-charging of the first node fora sensing operation; a first latch circuit connectable to theintermediate circuit to have a value latched therein set according thelevel on the first node; a second latch circuit connectable to theintermediate circuit to have a value latched therein set according thelevel on the first node; and first and second switches whereby the valuelatched in the first and second latch circuits can respectively betransferred to a data bus, wherein, in a sensing operation, afterpre-charging the first node and prior to a subsequent pre-charging, thefirst and second data latch circuits can sequentially be connected tohave the value latched therein set according to the level of the firstnode.
 2. The sense amplifier of claim 1, wherein the first latch circuitis connected between a second supply level and ground, and the secondlatch circuit is connected between the first supply level and ground,the second supply level being different than the first supply level. 3.The sense amplifier of claim 2, wherein the second supply level of ahigher level that the first supply level.
 4. The sense amplifier ofclaim 3, wherein the memory circuit on which the sense amplifier isformed further includes a charge pump circuit, where the second supplylevel is provided from the charge pump circuit.
 5. The sense amplifierof claim 1, wherein a node of the first latch that holds the valuelatched therein is connectable to the first node through a third switch.6. The sense amplifier of claim 6, wherein a node of the first latchthat holds the inverted version of the value latched therein isconnectable to the first node through a fourth switch.
 7. The senseamplifier of claim 1, wherein second latch circuit is used as part of aquick pass write verify
 8. A sense amplifier for a memory circuit,comprising: an intermediate circuit, including a first node selectivelyconnectable to one or more bit lines; a first latch circuit selectivelyconnectable to the first node; bit line selection circuitry connected tothe first node, whereby the first node can selectively be connected toone or more bit lines; a first switch whereby the first latch circuitcan be connected to a data bus; a second switch whereby the first nodeis selectively connectable to a first voltage supply level; and a thirdswitch whereby the first node is selectively connectable to an externalnode, wherein the path between the external node and the first node isformed of only n-type devices, and wherein when a value held in thefirst latch circuit is at a high value the first node is cut off fromthe first voltage supply level, and when the value held in the firstlatch circuit is at a low value the first node is cut off from theexternal node.
 9. The sense amplifier of claim 8, wherein the firstlatch circuit is connected between a second supply level and ground,wherein the second supply level of a higher level that the first supplylevel.
 10. The sense amplifier of claim 9, wherein the memory circuit onwhich the sense amplifier is formed further includes a charge pumpcircuit, where the second supply level is provided from the charge pumpcircuit.
 11. The sense amplifier of claim 8, wherein the first voltagesupply level is used for pre-charging for sensing.
 12. The senseamplifier of claim 8, wherein the voltage levels on the external nodehave a high level less than the first voltage supply level.
 13. Thesense amplifier of claim 12, wherein the voltage levels on the externalnode include a bit line value for verify lockout and a bit line value toenable programming.
 14. The sense amplifier of claim 8, furthercomprising: a pre-charge switch by which the first node can be connectedto the first voltage supply level for pre-charging of the first node fora sensing operation; a second latch circuit connectable to theintermediate circuit to have a value latched therein set according thelevel on the first node; and fourth and fifth switches whereby the valuelatched in the first and second latch circuits can respectively betransferred to a data bus, wherein, in a sensing operation, afterpre-charging the first node and prior to a subsequent pre-charging, thefirst and second data latch circuits can sequentially be connected tohave the value latched therein set according to the level of the firstnode.
 15. The sense amplifier of claim 14, wherein a node of the firstlatch that holds the value latched therein is connectable to the firstnode through a sixth switch.
 16. The sense amplifier of claim 15,wherein a node of the first latch that holds the inverted version of thevalue latched therein is connectable to the first node through a seventhswitch.
 17. The sense amplifier of claim 14, wherein second latchcircuit is used as part of a quick pass write verify
 18. The senseamplifier of claim 17, wherein the voltage levels on the external nodeinclude a bit line value to inhibit programming and a bit line value topartially inhibit programming.